An Effective Wrapper Architecture to Heterogeneous Data Source

نویسندگان

  • Hongzhi Wang
  • Jianzhong Li
  • Zhenying He
چکیده

In this paper, we focus on the problem in information integration system of obtaining data from heterogeneous data source accurately and effectively. XML is used as data exchange format of the wrapper. We design the wrapper architecture based on the conversion and management of the views as the bridge from global schema to local schema of various data sources. Our wrapper has two main subsystems, data extract subsystem and query executor subsystem. The former is for loading data for the cache in mediator when changes more than threshold are detected, and the latter is for answering the query from the mediator. The architecture adapts to the data and schema change of the data source and could answer the query of mediator effectively. Considering the wrapper may run in the environment without control, the process in wrapper should be simple enough. The storage in wrapper itself should be as small as possible and the storage of data source could be used. The detail of modules query rewrite, view management, query merge, result wrap and schema change detect are discussed. The behavior of wrapper during the query process in wrapper is discussed with a running example. The security strategy, especial in the distance that the wrapper runs in autonomic data source, is also introduced in this paper.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data Integration using Agent based Mediator-Wrapper Architecture

Organizations have to integrate multiple, distributed data sources and repositories for making their business decisions. These data sources include (but are not limited to) databases, object stores, knowledge bases, files systems, digital libraries and legacy systems. Data and/or system integration has become increasing important for enterprise computing. An agent-based mediate-wrapper software...

متن کامل

Federal Open Agent System Platform

Open Agent System platform based on High Level Architecture is firstly proposed to support the application involving heterogeneous agents. The basic idea is to develop different wrappers for different agent systems, which are wrapped as federates to join a federation. The platform is based on High Level Architecture and the advantages for this open standard are naturally inherited, such as syst...

متن کامل

Design of a description language for generating wrapper to collect biological data

The biological data are scattered in various areas with various formats and they are changing continuously. Therefore, data integration becomes an important issue to provide researcher a dynamic access of data. In the data integration process, the method of extracting heterogeneous data dynamically from the data source is an essential part. Data extraction method using wrapper can provide flexi...

متن کامل

Scientific Data Integration: Wrapping Textual Documents with a Database View Mechanism and an XML Engine

Nowadays scientiic data is inevitably digital and stored in a wide variety of formats in heterogeneous systems. Scientists need to access an integrated view of remote or local heterogeneous data sources with advanced data analyzing and visualization tools. Building a digital library for scientiic data requires accessing and manipulating data extracted from at les or documents retrieved from the...

متن کامل

Chapter 3 . 24 XWRAPComposer : A Multi - Page Data Extraction Service

We present a service-oriented architecture and a set of techniques for developing wrapper code generators, including the methodology of designing an effective wrapper program construction facility and a concrete implementation, called XWRAPComposer. Our wrapper generation framework has two unique design goals. First, we explicitly separate tasks of building wrappers that are specific to a Web s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003